UPM system for WMT 2012
نویسندگان
چکیده
This paper describes the UPM system for the Spanish-English translation task at the NAACL 2012 workshop on statistical machine translation. This system is based on Moses. We have used all available free corpora, cleaning and deleting some repetitions. In this paper, we also propose a technique for selecting the sentences for tuning the system. This technique is based on the similarity with the sentences to translate. With our approach, we improve the BLEU score from 28.37% to 28.57%. And as a result of the WMT12 challenge we have obtained a 31.80% BLEU with the 2012 test set. Finally, we explain different experiments that we have carried out after the competition.
منابع مشابه
UPM system for the translation task
This paper describes the UPM system for translation task at the EMNLP 2011 workshop on statistical machine translation (http://www.statmt.org/wmt11/), and it has been used for both directions: Spanish-English and English-Spanish. This system is based on Moses with two new modules for pre and post processing the sentences. The main contribution is the method proposed (based on the similarity wit...
متن کاملDFKI's SMT System for WMT 2012
We describe DFKI’s statistical based submission to the 2012 WMT evaluation. The submission is based on the freely available machine translation toolkit Jane, which supports phrase-based and hierarchical phrase-based translation models. Different setups have been tested and combined using a sentence selection method.
متن کاملJoint WMT 2012 Submission of the QUAERO Project
This paper describes the joint QUAERO submission to the WMT 2012 machine translation evaluation. Four groups (RWTH Aachen University, Karlsruhe Institute of Technology, LIMSI-CNRS, and SYSTRAN) of the QUAERO project submitted a joint translation for the WMT German→English task. Each group translated the data sets with their own systems and finally the RWTH system combination combined these tran...
متن کاملThe RWTH Aachen Machine Translation System for WMT 2012
This paper describes the statistical machine translation (SMT) systems developed at RWTH Aachen University for the translation task of the NAACL 2012 Seventh Workshop on Statistical Machine Translation (WMT 2012). We participated in the evaluation campaign for the French-English and German-English language pairs in both translation directions. Both hierarchical and phrase-based SMT systems are ...
متن کاملProposing a suitable soil quality index for natural, secondary and rehabilitated tropical forests in Malaysia
Proposing a suitable soil quality index for natural, secondary and rehabilitated tropical forests in Malaysia A. Arifin*, D. S. Karam, J. Shamshuddin, N. M. Majid, O. Radziah, A. H. Hazandy and I. Zahari Department of Forest Production, Faculty of Forestry, Universiti Putra Malaysia, 43400 UPM Serdang, Selangor, Malaysia. Laboratory of Sustainable Bioresource Management, Institute of Tropical F...
متن کامل